Inverse optimal control from incomplete trajectory observations
نویسندگان
چکیده
This article develops a methodology that enables learning an objective function of optimal control system from incomplete trajectory observations. The is assumed to be weighted sum features (or basis functions) with unknown weights, and the observed data segment states inputs. proposed technique introduces concept recovery matrix establish relationship between any available weights given candidate features. rank indicates whether subset relevant can found among corresponding learned data. obtained iteratively its non-decreasing property shows additional observations may contribute learning. Based on matrix, method for using learn selected established, incremental inverse algorithm developed by automatically finding minimal required observation. effectiveness demonstrated linear quadratic regulator simulated robot manipulator.
منابع مشابه
Inverse Optimal Control with Incomplete Observations
In this article, we consider the inverse optimal control problem given incomplete observations of an optimal trajectory. We hypothesize that the cost function is constructed as a weighted sum of relevant features (or basis functions). We handle the problem by proposing the recovery matrix, which establishes a relationship between available observations of the trajectory and weights of given can...
متن کاملFrom Inverse Kinematics to Optimal Control
Numerical optimal control (the approximation of an optimal trajectory using numerical iterative algorithms) is a promising approach to compute the control of complex dynamical systems whose instantaneous linearization is not meaningful. Aside from the problems of computation cost, these methods raise several conceptual problems, like stability, robustness, or simply understanding of the nature ...
متن کاملInverse Optimal Control
In Reinforcement Learning, an agent learns a policy that maximizes a given reward function. However, providing a reward function for a given learning task is often non trivial. Inverse Reinforcement Learning, which is sometimes also called Inverse Optimal Control, addresses this problem by learning the reward function from expert demonstrations. The aim of this paper is to give a brief introduc...
متن کاملReceding Horizon Control with Incomplete Observations
supported by the Fonds zur Förderung der wissenschaftlichen Forschung under SFB 03 ,,Optimierung und Kontrolle " .
متن کاملApproximate MaxEnt Inverse Optimal Control
Maximum entropy inverse optimal control (MaxEnt IOC) is an effective means of discovering the underlying cost function of demonstrated agent’s activity. To enable inference in large state spaces, we introduce an approximate MaxEnt IOC procedure to address the fundamental computational bottleneck stemming from calculating the partition function via dynamic programming. Approximate MaxEnt IOC is ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The International Journal of Robotics Research
سال: 2021
ISSN: ['1741-3176', '0278-3649']
DOI: https://doi.org/10.1177/0278364921996384